智能论文笔记

CTooth+: A Large-scale Dental Cone Beam Computed Tomography Dataset and Benchmark for Tooth Volume Segmentation

Weiwei Cui , Yaqi Wang , Yilong Li , Dan Song , Xingyong Zuo , Jiaojiao Wang , Yifan Zhang , Huiyu Zhou , Bung san Chong , Liaoyuan Zeng

分类：人工智能 | 计算机视觉

2022-08-02

准确的牙齿体积分割是计算机辅助牙齿分析的先决条件。基于深度学习的牙齿分割方法已经达到了令人满意的表现，但需要大量的牙齿数据。公开可用的牙科数据是有限的，这意味着无法在临床实践中复制，评估和应用现有方法。在本文中，我们建立了一个3D Dental CBCT数据集Ctooth+，具有22个完全注释的卷和146个未标记的体积。我们进一步评估了基于完全监督的学习，半监督学习和积极学习的几种最先进的牙齿量细分策略，并定义了绩效原则。这项工作为牙齿体积分割任务提供了新的基准，该实验可以作为未来基于AI的牙科成像研究和临床应用开发的基线。

translated by 谷歌翻译

CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images

Weiwei Cui , Yaqi Wang , Qianni Zhang , Huiyu Zhou , Dan Song , Xingyong Zuo , Gangyong Jia , Liaoyuan Zeng

分类：计算机视觉 | 人工智能

2022-06-17

3D牙齿分割是计算机辅助牙齿诊断和治疗的先决条件。但是，将所有牙齿区域分割为主观且耗时。最近，基于深度学习的细分方法产生了令人信服的结果并减少了手动注释的工作，但是它需要大量的基础真相进行培训。据我们所知，3D分割研究几乎没有牙齿数据。在本文中，我们建立了带有牙齿金标准的完全注释的锥束计算机断层扫描数据集。该数据集包含22卷（7363片），并带有经验丰富的射线照相解释者注释的精细牙齿标签。为了确保相对的数据采样分布，数据方差包括在牙齿中，包括缺失的牙齿和牙齿修复。在此数据集上评估了几种最新的分割方法。之后，我们进一步总结并应用了一系列基于3D注意的UNET变体以分割牙齿。这项工作为牙齿体积分割任务提供了新的基准。实验证据证明，3D UNET结构的注意力模块增强了牙齿区域中的反应，并抑制背景和噪声的影响。 3D UNET使用SKNET注意模块实现了最佳性能，分别为88.04 \％骰子和78.71 \％IOU。基于注意力的UNET框架的表现优于Ctooth数据集上的其他最新方法。代码库和数据集已发布。

translated by 谷歌翻译

Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs

Yusheng Wang , Weiwei Song , Yidong Lou , Fei Huang , Zhiyong Tu , Shimin Zhang

分类：机器人

2021-12-25

精确和实时轨道车辆本地化以及铁路环境监测对于铁路安全至关重要。在这封信中，我们提出了一种基于多激光器的同时定位和映射（SLAM）系统，用于铁路应用。我们的方法从测量开始预处理，以便去噪并同步多个LIDAR输入。根据LIDAR放置使用不同的帧到框架注册方法。此外，我们利用来自提取的轨道轨道的平面约束来提高系统精度。本地地图进一步与利用绝对位置测量的全局地图对齐。考虑到不可避免的金属磨损和螺杆松动，在手术期间唤醒了在线外在细化。在收集3000公里的数据集上广泛验证了所提出的方法。结果表明，所提出的系统与大规模环境的有效映射一起实现了精确且稳健的本地化。我们的系统已应用于运费交通铁路以监控任务。

translated by 谷歌翻译

Rail Vehicle Localization and Mapping with LiDAR-Vision-Inertial-GNSS Fusion

Yusheng Wang , Weiwei Song , Yidong Lou , Yi Zhang , Fei Huang , Zhiyong Tu , Qiangsheng Liang

分类：机器人

2021-12-16

在本文中，我们介绍了全球导航卫星系统（GNSS）辅助激光乐队 - 视觉惯性方案RAILTOMER-V，用于准确且坚固的铁路车辆本地化和映射。 Raillomer-V在因子图上制定，由两个子系统组成：辅助LiDar惯性系统（OLIS）和距离的内径综合视觉惯性系统（OVI）。两个子系统都利用了铁路上的典型几何结构。提取的轨道轨道的平面约束用于补充OLI中的旋转和垂直误差。此外，线特征和消失点被利用以限制卵巢中的旋转漂移。拟议的框架在800公里的数据集中广泛评估，聚集在一年以上的一般速度和高速铁路，日夜。利用各个传感器的所有测量的紧密耦合集成，我们的框架准确到了长期的任务，并且足够强大地避免了退行的情景（铁路隧道）。此外，可以使用车载计算机实现实时性能。

translated by 谷歌翻译

RailLoMer: Rail Vehicle Localization and Mapping with LiDAR-IMU-Odometer-GNSS Data Fusion

Yusheng Wang , Yidong Lou , Yi Zhang , Weiwei Song , Fei Huang , Zhiyong Tu , Shimin Zhang

分类：机器人

2021-11-30

我们在本文中介绍Raillomer，实现实时准确和鲁棒的内径测量和轨道车辆的测绘。 Raillomer从两个Lidars，IMU，火车车程和全球导航卫星系统（GNSS）接收器接收测量。作为前端，来自IMU / Royomer缩放组的估计动作De-Skews DeSoised Point云并为框架到框架激光轨道测量产生初始猜测。作为后端，配制了基于滑动窗口的因子图以共同优化多模态信息。另外，我们利用来自提取的轨道轨道和结构外观描述符的平面约束，以进一步改善对重复结构的系统鲁棒性。为了确保全局常见和更少的模糊映射结果，我们开发了一种两级映射方法，首先以本地刻度执行扫描到地图，然后利用GNSS信息来注册模块。该方法在聚集的数据集上广泛评估了多次范围内的数据集，并且表明Raillomer即使在大或退化的环境中也能提供排入量级定位精度。我们还将Raillomer集成到互动列车状态和铁路监控系统原型设计中，已经部署到实验货量交通铁路。

translated by 谷歌翻译

MetroLoc: Metro Vehicle Mapping and Localization with LiDAR-Camera-Inertial Integration

Yusheng Wang , Weiwei Song , Yi Zhang , Fei Huang , Zhiyong Tu , Yidong Lou

分类：机器人

2021-11-01

我们提出了一种准确而坚固的多模态传感器融合框架，Metroloc，朝着最极端的场景之一，大规模地铁车辆本地化和映射。 Metroloc在以IMU为中心的状态估计器上构建，以较轻耦合的方法紧密地耦合光检测和测距（LIDAR），视觉和惯性信息。所提出的框架由三个子模块组成：IMU Odometry，LiDar - 惯性内径术（LIO）和视觉惯性内径（VIO）。 IMU被视为主要传感器，从LIO和VIO实现了从LIO和VIO的观察，以限制加速度计和陀螺仪偏差。与以前的点LIO方法相比，我们的方法通过将线路和平面特征引入运动估计来利用更多几何信息。 VIO还通过使用两条线和点来利用环境结构信息。我们所提出的方法在具有维护车辆的长期地铁环境中广泛测试。实验结果表明，该系统比使用实时性能的最先进的方法更准确和强大。此外，我们开发了一系列虚拟现实（VR）应用，以实现高效，经济，互动的轨道车辆状态和轨道基础设施监控，已经部署到室外测试铁路。

translated by 谷歌翻译

NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis

Xu Ye , Meng Xiao , Zhiyuan Ning , Weiwei Dai , Wenjuan Cui , Yi Du , Yuanchun Zhou

分类：自然语言处理

2022-12-27

With the development of natural language processing techniques(NLP), automatic diagnosis of eye diseases using ophthalmology electronic medical records (OEMR) has become possible. It aims to evaluate the condition of both eyes of a patient respectively, and we formulate it as a particular multi-label classification task in this paper. Although there are a few related studies in other diseases, automatic diagnosis of eye diseases exhibits unique characteristics. First, descriptions of both eyes are mixed up in OEMR documents, with both free text and templated asymptomatic descriptions, resulting in sparsity and clutter of information. Second, OEMR documents contain multiple parts of descriptions and have long document lengths. Third, it is critical to provide explainability to the disease diagnosis model. To overcome those challenges, we present an effective automatic eye disease diagnosis framework, NEEDED. In this framework, a preprocessing module is integrated to improve the density and quality of information. Then, we design a hierarchical transformer structure for learning the contextualized representations of each sentence in the OEMR document. For the diagnosis part, we propose an attention-based predictor that enables traceable diagnosis by obtaining disease-specific information. Experiments on the real dataset and comparison with several baseline models show the advantage and explainability of our framework.

translated by 谷歌翻译

Contrastive Learning Reduces Hallucination in Conversations

Weiwei Sun , Zhengliang Shi , Shen Gao , Pengjie Ren , Maarten de Rijke , Zhaochun Ren

分类：自然语言处理 | 人工智能

2022-12-20

Pre-trained language models (LMs) store knowledge in their parameters and can generate informative responses when used in conversational systems. However, LMs suffer from the problem of "hallucination:" they may generate plausible-looking statements that are irrelevant or factually incorrect. To address this problem, we propose a contrastive learning scheme, named MixCL. A novel mixed contrastive objective is proposed to explicitly optimize the implicit knowledge elicitation process of LMs, and thus reduce their hallucination in conversations. We also examine negative sampling strategies of retrieved hard negatives and model-generated negatives. We conduct experiments on Wizard-of-Wikipedia, a public, open-domain knowledge-grounded dialogue benchmark, and assess the effectiveness of MixCL. MixCL effectively reduces the hallucination of LMs in conversations and achieves the highest performance among LM-based dialogue agents in terms of relevancy and factuality. We show that MixCL achieves comparable performance to state-of-the-art KB-based approaches while enjoying notable advantages in terms of efficiency and scalability.

translated by 谷歌翻译

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

Zizhang Wu , Man Wang , Weiwei Sun , Yuchen Li , Tianhao Xu , Fan Wang , Keke Huang

分类：计算机视觉

2022-12-13

Channel and spatial attention mechanism has proven to provide an evident performance boost of deep convolution neural networks (CNNs). Most existing methods focus on one or run them parallel (series), neglecting the collaboration between the two attentions. In order to better establish the feature interaction between the two types of attention, we propose a plug-and-play attention module, which we term "CAT"-activating the Collaboration between spatial and channel Attentions based on learned Traits. Specifically, we represent traits as trainable coefficients (i.e., colla-factors) to adaptively combine contributions of different attention modules to fit different image hierarchies and tasks better. Moreover, we propose the global entropy pooling (GEP) apart from global average pooling (GAP) and global maximum pooling (GMP) operators, an effective component in suppressing noise signals by measuring the information disorder of feature maps. We introduce a three-way pooling operation into attention modules and apply the adaptive mechanism to fuse their outcomes. Extensive experiments on MS COCO, Pascal-VOC, Cifar-100, and ImageNet show that our CAT outperforms existing state-of-the-art attention mechanisms in object detection, instance segmentation, and image classification. The model and code will be released soon.

translated by 谷歌翻译

Data Imputation with Iterative Graph Reconstruction

Jiajun Zhong , Weiwei Ye , Ning Gui

分类：机器学习

2022-12-06

Effective data imputation demands rich latent ``structure" discovery capabilities from ``plain" tabular data. Recent advances in graph neural networks-based data imputation solutions show their strong structure learning potential by directly translating tabular data as bipartite graphs. However, due to a lack of relations between samples, those solutions treat all samples equally which is against one important observation: ``similar sample should give more information about missing values." This paper presents a novel Iterative graph Generation and Reconstruction framework for Missing data imputation(IGRM). Instead of treating all samples equally, we introduce the concept: ``friend networks" to represent different relations among samples. To generate an accurate friend network with missing data, an end-to-end friend network reconstruction solution is designed to allow for continuous friend network optimization during imputation learning. The representation of the optimized friend network, in turn, is used to further optimize the data imputation process with differentiated message passing. Experiment results on eight benchmark datasets show that IGRM yields 39.13% lower mean absolute error compared with nine baselines and 9.04% lower than the second-best.

translated by 谷歌翻译